Pronouncing Text by Analogy
نویسندگان
چکیده
Pronunciation-by-analogy (PbA) is an emerging technique for text-phoneme conversion based on a psychological model of reading aloud. This paper explores the impact of certain basic implementational choices on the performance of various PbA models. These have been tested on their ability to pronounce sets of short pseudowords previously used in similar studies, as well as lexical words temporarily removed from the dictionary. Best results of 85.7% and 67.9% words correct are obtained lor the pseudowords and lexical words respectively, casting doubt on certain previous-reported performance figures in the literature.
منابع مشابه
Evaluating the Pronunciation Component of Text-to-Speech Systems for English: A Performance Comparison
The automatic derivation of word pronunciations from input text is a central task for any text-to-speech system. For general English text at least, this is often thought to be a solved problem, with manually-derived linguistic rules assumed capable of handling ‘novel’ words missing from the system dictionary. Data-driven methods, based on machine learning of the regularities implicit in a large...
متن کاملEvaluating the pronunciation component of text-to-speech systems for English: a performance comparison of different approaches
The automatic derivation of word pronunciations from input text is a central task for any text-to-speech system. For general English text at least, this is often thought to be a solved problem, with manually-derived linguistic rules assumed capable of handling “novel” words missing from the system dictionary. Data-driven methods, based on machine learning of the regularities implicit in a large...
متن کاملPronouncing unknown words using multi-dimensional analogies
In this paper, a model of analogy-based learning is presented, whose main novelty is the crucial ability to produce analogies in multi-dimensional input and output spaces. Evaluations are performed on various word pronunciation tasks, revealing the effectiveness of such joint learning strategies.
متن کاملTwo Database Resources for Processing Social Media English Text
This research focuses on text processing in the sphere of English-language social media. We introduce two database resources. The first, CECS (Casual English Conversion System) database, a lexicon-type resource of 1,255 entries, was constructed for use in our experimental system for the automated normalization of casual, irregularly-formed English used in communications such as Twitter. Our rul...
متن کاملPronunciation dependent language models
Speech recognition systems are conventionally broken up into phonemic acoustic models, pronouncing dictionaries in terms of the phonemic units in the acoustic model and language models in terms of lexical units from the pronouncing dictionary. Here we explore a new method for incorporating pronunciation probabilities into recognition systems by moving them from the pronouncing lexicon into the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996